Accelerated Optical Flow Function Algorithm Using Compute Unified Device Architecture
نویسندگان
چکیده
منابع مشابه
CUDASA: Compute Unified Device and Systems Architecture
ion Exposed Internal Built-ins application layer __sequence__ network layer __job__ __node__ jobIdx, jobDim bus layer __task__ __host__ taskIdx, taskDim GPU layer __global__ __device__ gridDim, blockIdx, blockDim, threadIdx Exposed functions are accessible from next higher abstraction Built-ins are automatically propagated to all underlying layers
متن کاملSolutions for Optimizing the Stream Compaction Algorithmic Function Using the Compute Unified Device Architecture
In this paper, I have researched and developed solutions for optimizing the stream compaction algorithmic function using the Compute Unified Device Architecture (CUDA). The stream compaction is a common parallel primitive, an essential building block for many data processing algorithms, whose optimization improves the performance of a wide class of parallel algorithms useful in data processing....
متن کاملComputation Optical Flow Using Pipeline Architecture
Accurate estimation of motion from time-varying imagery has been a popular problem in vision studies, This information can be used in segmentation, 3D motion and shape recovery, target tracking, and other problems in scene analysis and interpretation. We have presented a dynamic image model for estimating image motion from image sequences, and have shown how the solution can be obtained from a ...
متن کاملParallel Prefix Scan with Compute Unified Device Architecture (cuda)
Parallel prefix scan, also known as parallel prefix sum, is a building block for many parallel algorithms including polynomial evaluation, sorting and building data structures. This paper introduces prefix scan and also describes a step-bystep procedure to implement prefix scan efficiently with Compute Unified Device Architecture (CUDA). This paper starts with a basic naive algorithm and procee...
متن کاملImproving Software Performance in the Compute Unified Device Architecture
This paper analyzes several aspects regarding the improvement of software performance for applications written in the Compute Unified Device Architecture (CUDA). We address an issue of great importance when programming a CUDA application: the Graphics Processing Unit’s (GPU’s) memory management through transpose kernels. We also benchmark and evaluate the performance for progressively optimizin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Engineering
سال: 2012
ISSN: 1877-7058
DOI: 10.1016/j.proeng.2012.07.320